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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL; NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) 
15 ORF-2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

20 ATGCGCCCTC GGCCTATTTT GTTGCTGCTC CTCATGTTTT TGCCTATGCT GCCCGCGCCA 60 

CCGCCCGGTC AGCCGTCTGG CCGCCGTCGT GGGCGGCGCA GCGGCGGTTC CGGCGGTGGT 120 

TTCTGGGGTG ACCGGGTTGA TTCTCAGCCC TTCGCAATCC CCTATATTCA TCCAACCAAC 180 

25 

CCCTTCGCCC CCGATGTCAC CGCTGCGGCC GGGGCTGGAC CTCGTGTTCG CCAACCCGCC 240 

CGACCACTCG GCTCCGCTTG GCGTGACCAG GCCCAGCGCC CCGCCGTTGC CTCACGTCGT 300 

3 0 AGACCTACCA CAGCTGGGGC CGCGCCGCTA ACCGCGGTCG CTCCGGCCCA TGACACCCCG 360 

CCAGTGCCTG ATGTCGACTC CCGCGGCGCC ATCTTGCGCC GGCAGTATAA CCTATCAACA 420 

TCTCCCCTTA CCTCTTCCGT GGCCACCGGC ACTAACCTGG TTCTTTATGC CGCCCCTCTT 480 

AGTCCGCTTT TACCCCTTCA GGACGGCACC AATACCCATA TAATGGCCAC GGAAGCTTCT 540 

AATTATGCCC AGTACCGGGT TGCCCGTGCC ACAATCCGTT ACCGCCCGCT GGTCCCCAAT 600 



40 



GCTGTCGGCG GTTACGCCAT CTCCATCTCA TTCTGGCCAC AGACCACCAC CACCCCGACG 



660 
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TCCGTTGATA TGAATTCAAT AACCTCGACG GATGTTCGTA TTTTAGTCCA GCCCGGCATA 720 

GCCTCTGAGC TTGTGATCCC AAGTGAGCGC CTACACTATC GTAACCAAGG CTGGCGCTCC 780 

GTCGAGACCT CTGGGGTGGC TGAGGAGGAG GCTACCTCTG GTCTTGTTAT GCTTTGCATA 840 

CATGGCTCAC TCGTAAATTC CTATACTAAT ACACCCTATA CCGGTGCCCT CGGGCTGTTG 900 

GACTTTGCCC TTGAGCTTGA GTTTCGCAAC CTTACCCCCG GTAACACCAA TACGCGGGTC 960 

TCCCGTTATT CCAGCACTGC TCGCCACCGC CTTCGTCGCG GTGCGGACGG GACTGCCGAG 1020 

CTCACCACCA CGGCTGCTAC CCGCTTTATG AAGGACCTCT ATTTTACTAG TACTAATGGT 1080 

15 GTCGGTGAGA TCGGCCGCGG GATAGCCCTC ACCCTGTTCA ACCTTGCTGA CACTCTGCTT 1140 

GGCGGCCTGC CGACAGAATT GATTTCGTCG GCTGGTGGCC AGCTGTTCTA CTCCCGTCCC 1200 

GTTGTCTCAG CCAATGGCGA GCCGACTGTT AAGTTGTATA CATCTGTAGA GAATGCTCAG 1260 

20 

CAGGATAAGG GTATTGCAAT CCCGCATGAC ATTGACCTCG GAGAATCTCG TGTGGTTATT * 1320 

CAGGATTATG ATAACCAACA TGAACAAGAT CGGCCGACGC CTTCTCCAGC CCCATCGCGC 1380 

2 5 CCTTTCTCTG TCCTTCGAGC TAATGATGTG CTTTGGCTCT CTCTCACCGC TGCCGAGTAT 1440 

GACCAGTCCA CTTATGGCTC TTCGACTGGC CCAGTTTATG TTTCTGACTC TGTGACCTTG 1500 

GTTAATGTTG CGACCGGCGC GCAGGCCGTT GCCCGGTCGC TCGATTGGAC CAAGGTCACA 1560 

30 

CTTGACGGTC GCCCCCTCTC CACCATCCAG CAGTACTCGA AGACCTTCTT TGTCCTGCCG 1620 

CTCCGCGGTA AGCTCTCTTT CTGGGAGGCA GGCACAACTA AAGCCGGGTA CCCTTATAAT 1680 

3 5 TATAACACCA CTGCTAGCGA CCAACTGCTT GTCGAGAATG CCGCCGGGCA CCGGGTCGCT 1740 

ATTTCCACTT ACACCACTAG CCTGGGTGCT GGTCCCGTCT CCATTTCTGC GGTTGCCGTT 1800 

TTAGCCCCCC ACTCTGCGCT AGCATTGCTT GAGGATACCT TGGACTACCC TGCCCGCGCC 1860 

CATACTTTTG ATGATTTCTG CCCAGAGTGC CGCCCCCTTG GCCTTCAGGG CTGCGCTTTC 1920 
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CAGTCTACTG TCGCTGAGCT TCAGCGCCTT AAGATGAAGG TGGGTAAAAC TCGGGAGTTG 1980 

TAGTTTATTT GCTTGTGCCC CCCTTCTTTC TGTTGCTTAT TTCTCATTTC TGCGTTCCGC 2040 

5 GCTCCCTGA 2049 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 2058 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
rfS (vi) ORIGINAL SOURCE: 

Uy 20 (C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico Strain) 

ORF-2 region 



□ 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

25 ATGCGCCCTA GGCCTCTTTT GCTGTTGTTC CTCTTGTTTC TGCCTATGTT GCCCGCGCCA 60 

CCGACCGGTC AGCCGTCTGG CCGCCGTCGT GGGCGGCGCA GCGGCGGTAC CGGCGGTGGT 120 

TTCTGGGGTG ACCGGGTTGA TTCTCAGCCC TTCGCAATCC CCTATATTCA TCCAACCAAC 180 

30 

CCCTTTGCCC CAGACGTTGC CGCTGCGTCC GGGTCTGGAC CTCGCCTTCG CCAACCAGCC 240 

CGGCCACTTG GCTCCACTTG GCGAGATCAG GCCCAGCGCC CCTCCGCTGC CTCCCGTCGC 300 

3 5 CGACCTGCCA CAGCCGGGGC TGCGGCGCTG ACGGCTGTGG CGCCTGCCCA TGACACCTCA 360 

CCCGTCCCGG ACGTTGATTC TCGCGGTGCA ATTCTACGCC GCCAGTATAA TTTGTCTACT 420 

TCACCCCTGA CATCCTCTGT GGCCTCTGGC ACTAATTTAG TCCTGTATGC AGCCCCCCTT 480 

AATCCGCCTC TGCCGCTGCA GGACGGTACT AATACTCACA TTATGGCCAC AGAGGCCTCC 540 
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AATTATGCAC AGTACCGGGT TGCCCGCGCT ACTATCCGTT ACCGGCCCCT AGTGCCTAAT 600 

GCAGTTGGAG GCTATGCTAT ATCCATTTCT TTCTGGCCTC AAACAACCAC AACCCCTACA 660 

TCTGTTGACA TGAATTCCAT TACTTCCACT GATGTCAGGA TTCTTGTTCA ACCTGGCATA 720 

GCATCTGAAT TGGTCATCCC AAGCGAGCGC CTTCACTACC GCAATCAAGG TTGGCGCTCG 780 

GTTGAGACAT CTGGTGTTGC TGAGGAGGAA GCCACCTCCG GTCTTGTCAT GTTATGCATA 840 

CATGGCTCTC CAGTTAACTC CTATACCAAT ACCCCTTATA CCGGTGCCCT TGGCTTACTG 900 

GACTTTGCCT TAGAGCTTGA GTTTCGCAAT CTCACCACCT GTAACACCAA TACACGTGTG 960 

15 TCCCGTTACT CCAGCACGGC CCGTCACCGG CTCCGCCGAG GGGCCGACGG GACTGCGGAG 1020 

CTGACCACAA CTGCAGCCAC CAGGTTCATG AAAGATCTCC ACTTTACCGG CCTTAATGGG 1080 

GTAGGTGAAG TCGGCCGCGG GATAGCTCTA ACATTACTTA ACCTTGCTGA CACGCTCCTC 1140 



10 



HP 2 0 

Q GGCGGGCTCC CGACAGAATT AATTTCGTCG GCTGGCGGGC AACTGTTTTA TTCCCGCCCG 1200 

GTTGTCTCAG CCAATGGCGA GCCAACCGTG AAGCTCTATA CATCAGTGGA GAATGCTCAG 1260 

\tA 2 5 CAGGATAAGG GTGTTGCTAT CCCCCACGAT ATCGATCTTG GTGATTCGCG TGTGGTCATT 1320 

CAGGATTATG ACAACCAGCA TGAGCAGGAT CGGCCCACCC CGTCGCCTGC GCCATCTCGG 1380 

! n 

CCTTTTTCTG TTCTCCGAGC AAATGATGTA CTTTGGCTGT CCCTCACTGC AGCCGAGTAT 1440 

30 

GACCAGTCCA CTTACGGGTC GTCAACTGGC CCGGTTTATA TCTCGGACAG CGTGACTTTG 1500 

GTGAATGTTG CGACTGGCGC GCAGGCCGTA GCCCGATCGC TTGACTGGTC CAAAGTCACC 1560 

3 5 CTCGACGGGC GGCCCCTCCC GACTGTTGAG CAATATTCCA AGACATTCTT TGTGCTCCCC 1620 

CTTCGTGGCA AGCTCTCCTT TTGGGAGGCC GGCACAACAA AAGCAGGTTA TCCTTATAAT 1680 

TATAATACTA CTGCTAGTGA CCAGATTCTG ATTGAAAATG CTGCCGGCCA TCGGGTCGCC 1740 

ATTTCAACCT ATACCACCAG GCTTGGGGCC GGTCCGGTCG CCATTTCTGC GGCCGCGGTT 1800 
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GTTCGTATTT TAGTCCAGCC CGGCATAGCC TCTGAGCTTG TGATCCCAAG TGAGCGCCTA 420 

CACTATCGTA ACCAAGGCTG GCGCTCCGTC GAGACCTCTG GGGTGGCTGA GGAGGAGGCT 480 

ACCTCTGGTC TTGTTATGCT TTGCATACAT GGCTCACTCG TAAATTCCTA TACTAATACA 540 

CCCTATACCG GTGCCCTCGG GCTGTTGGAC TTTGCCCTTG AGCTTGAGTT TCGCAACCTT 600 

ACCCCCGGTA ACACCAATAC GCGGGTCTCC CGTTATTCCA GCACTGCTCG CCACCGCCTT 660 

CGTCGCGGTG CGGACGGGAC TGCCGAGCTC ACCACCACGG CTGCTACCCG CTTTATGAAG 720 

GACCTCTATT TTACTAGTAC TAATGGTGTC GGTGAGATCG GCCGCGGGAT AGCCCTCACC 780 

15 CTGTTCAACC TTGCTGACAC TCTGCTTGGC GGCCTGCCGA CAGAATTGAT TTCGTCGGCT 840 

□ 

GGTGGCCAGC TGTTCTACTC CCGTCCCGTT GTCTCAGCCA ATGGCGAGCC GACTGTTAAG 900 

CH TTGTATACAT CTGTAGAGAA TGCTCAGCAG GATAAGGGTA TTGCAATCCC GCATGACATT 960 

i fa 

2 0 

GACCTCGGAG AATCTCGTGT GGTTATTCAG GATTATGATA ACCAACATGA ACAAGATCGG 1020 

CCGACGCCTT CTCCAGCCCC ATCGCGCCCT TTCTCTGTCC TTCGAGCTAA TGATGTGCTT 1080 

25 TGGCTCTCTC TCACCGCTGC CGAGTATGAC CAGTCCACTT ATGGCTCTTC GACTGGCCCA 1140 

GTTTATGTTT CTGACTCTGT GACCTTGGTT AATGTTGCGA CCGGCGCGCA GGCCGTTGCC 1200 

CGGTCGCTCG ATTGGACCAA GGTCACACTT GACGGTCGCC CCCTCTCCAC CATCCAGCAG 1260 

30 

TACTCGAAGA CCTTCTTTGT CCTGCCGCTC CGCGGTAAGC TCTCTTTCTG GGAGGCAGGC 1320 

ACAACTAAAG CCGGGTACCC TTATAATTAT AACACCACTG CTAGCGACCA ACTGCTTGTC 1380 

3 5 GAGAATGCCG CCGGGCACCG GGTCGCTATT TCCACTTACA CCACTAGCCT GGGTGCTGGT 1440 
CCCGTCTCCA TTTCTGCGGT TGCCGTTTTA GCCCCCCACT CTGCGCTAGC ATTGCTTGAG 1500 
GATACCTTGG ACTACCCTGC CCGCGCCCAT ACTTTTGATG ATTTCTGCCC AGAGTGCCGC 1560 
CCCCTTGGCC TTCAGGGCTG CGCTTTCCAG TCTACTGTCG CTGAGCTTCA GCGCCTTAAG 1620 
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ATGAAGGTGG GTAAAACTCG GGAGTTG 1647 
(2) INFORMATION FOR SEQ ID NO: 4: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1647 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

15 (vi) ORIGINAL SOURCE: 

□ (C) INDIVIDUAL ISOLATE: Hepatitis E virus (Mexico strain) 

; '=3 r62kDa, FIGURE 2 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

GCTGTGGCGC CTGCCCATGA CACCTCACCC GTCCCGGACG TTGATTCTCG CGGTGCAATT 60 

CTACGCCGCC AGTATAATTT GTCTACTTCA CCCCTGACAT CCTCTGTGGC CTCTGGCACT 120 

25 AATTTAGTCC TGTATGCAGC CCCCCTTAAT CCGCCTCTGC CGCTGCAGGA CGGTACTAAT 180 

ACTCACATTA TGGCCACAGA GGCCTCCAAT TATGCACAGT ACCGGGTTGC CCGCGCTACT 240 

ATCCGTTACC GGCCCCTAGT GCCTAATGCA GTTGGAGGCT ATGCTATATC CATTTCTTTC 300 

30 

TGGCCTCAAA CAACCACAAC CCCTACATCT GTTGACATGA ATTCCATTAC TTCCACTGAT 360 

GTCAGGATTC TTGTTCAACC TGGCATAGCA TCTGAATTGG TCATCCCAAG CGAGCGCCTT 420 

3 5 CACTACCGCA ATCAAGGTTG GCGCTCGGTT GAGACATCTG GTGTTGCTGA GGAGGAAGCC 480 

ACCTCCGGTC TTGTCATGTT ATGCATACAT GGCTCTCCAG TTAACTCCTA TACCAATACC 540 

CCTTATACCG GTGCCCTTGG CTTACTGGAC TTTGCCTTAG AGCTTGAGTT TCGCAATCTC 600 

ACCACCTGTA ACACCAATAC ACGTGTGTCC CGTTACTCCA GCACGGCCCG TCACCGGCTC 660 
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CGCCGAGGGG CCGACGGGAC TGCGGAGCTG ACCACAACTG CAGCCACCAG GTTCATGAAA 720 

GATCTCCACT TTACCGGCCT TAATGGGGTA GGTGAAGTCG GCCGCGGGAT AGCTCTAACA 780 

5 TTACTTAACC TTGCTGACAC GCTCCTCGGC GGGCTCCCGA CAGAATTAAT TTCGTCGGCT 840 

GGCGGGCAAC TGTTTTATTC CCGCCCGGTT GTCTCAGCCA ATGGCGAGCC AACCGTGAAG 900 

CTCTATACAT CAGTGGAGAA TGCTCAGCAG GATAAGGGTG TTGCTATCCC CCACGATATC 960 

10 

GATCTTGGTG ATTCGCGTGT GGTCATTCAG GATTATGACA ACCAGCATGA GCAGGATCGG 1020 

CCCACCCCGT CGCCTGCGCC ATCTCGGCCT TTTTCTGTTC TCCGAGCAAA TGATGTACTT 1080 

15 TGGCTGTCCC TCACTGCAGC CGAGTATGAC CAGTCCACTT ACGGGTCGTC AACTGGCCCG 1140 

uQ GTTTATATCT CGGACAGCGT GACTTTGGTG AATGTTGCGA CTGGCGCGCA GGCCGTAGCC 1200 

m 

^ CGATCGCTTG ACTGGTCCAA AGTCACCCTC GACGGGCGGC CCCTCCCGAC TGTTGAGCAA 1260 

' ! i\J 

q 20 

rn TATTCCAAGA CATTCTTTGT GCTCCCCCTT CGTGGCAAGC TCTCCTTTTG GGAGGCCGGC 1320 

m 

ACAACAAAAG CAGGTTATCC TTATAATTAT AATACTACTG CTAGTGACCA GATTCTGATT 1380 

1=5 s 25 GAAAATGCTG CCGGCCATCG GGTCGCCATT TCAACCTATA CCACCAGGCT TGGGGCCGGT 1440 

Q CCGGTCGCCA TTTCTGCGGC CGCGGTTTTG GCTCCACGCT CCGCCCTGGC TCTGCTGGAG 1500 

GATACTTTTG ATTATCCGGG GCGGGCGCAC ACATTTGATG ACTTCTGCCC TGAATGCCGC 1560 

30 

GCTTTAGGCC TCCAGGGTTG TGCTTTCCAG TCAACTGTCG CTGAGCTCCA GCGCCTTAAA 1620 

GTTAAGGTGG GTAAAACTCG GGAGTTG 1647 

3 5 (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 984 base pairs 

(B) TYPE: nucleic acid 

4 0 (C) STRANDEDNESS: double 

(D) TOPOLOGY: Linear 
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GGCCTTCAGG GCTGCGCTTT CCAGTCTACT GTCGCTGAGC TTCAGCGCCT TAAGATGAAG 960 
GTGGGTAAAA CTCGGGAGTT GTAG 984 
5 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 984 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
15 (iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatits E Virus (Mexico strain) SG3 
region 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Q GGGGCCGACG GGACTGCGGA GCTGACCACA ACTGCAGCCA CCAGGTTCAT GAAAGATCTC 60 
25 

fUj CACTTTACCG GCCTTAATGG GGTAGGTGAA GTCGGCCGCG GGATAGCTCT AACATTACTT 120 

^ AACCTTGCTG ACACGCTCCT CGGCGGGCTC CCGACAGAAT TAATTTCGTC GGCTGGCGGG 180 

3 0 CAACTGTTTT ATTCCCGCCC GGTTGTCTCA GCCAATGGCG AGCCAACCGT GAAGCTCTAT 240 

ACATCAGTGG AGAATGCTCA GCAGGATAAG GGTGTTGCTA TCCCCCACGA TATCGATCTT 300 

GGTGATTCGC GTGTGGTCAT TCAGGATTAT GACAACCAGC ATGAGCAGGA TCGGCCCACC 360 

CCGTCGCCTG CGCCATCTCG GCCTTTTTCT GTTCTCCGAG CAAATGATGT ACTTTGGCTG 420 

TCCCTCACTG CAGCCGAGTA TGACCAGTCC ACTTACGGGT CGTCAACTGG CCCGGTTTAT 480 
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ATCTCGGACA GCGTGACTTT GGTGAATGTT GCGACTGGCG CGCAGGCCGT AGCCCGATCG 540 
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CTTGACTGGT CCAAAGTCAC CCTCGACGGG CGGCCCCTCC CGACTGTTGA GCAATATTCC 600 

AAGACATTCT TTGTGCTCCC CCTTCGTGGC AAGCTCTCCT TTTGGGAGGC CGGCACAACA 660 

AAAGCAGGTT ATCCTTATAA TTATAATACT ACTGCTAGTG ACCAGATTCT GATTGAAAAT 720 

GCTGCCGGCC ATCGGGTCGC CATTTCAACC TATACCACCA GGCTTGGGGC CGGTCCGGTC 780 

GCCATTTCTG CGGCCGCGGT TTTGGCTCCA CGCTCCGCCC TGGCTCTGCT GGAGGATACT 840 

TTTGATTATC CGGGGCGGGC GCACACATTT GATGACTTCT GCCCTGAATG CCGCGCTTTA 900 

GGCCTCCAGG GTTGTGCTTT CCAGTCAACT GTCGCTGAGC TCCAGCGCCT TAAAGTTAAG 960 

15 GTGGGTAAAA CTCGGGAGTT GTAG 984 

(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 147 base pairs 

(B) TYPE: nucleic acid 
Cfl (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



py 2 5 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 
3 0 (C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) 

406,3-2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ACCTTGGACT ACCCTGCCCG CGCCCATACT TTTGATGATT TCTGCCCAGA GTGCCGCCCC 60 

CTTGGCCTTC AGGGCTGCGC TTTCCAGTCT ACTGTCGCTG AGCTTCAGCG CCTTAAGATG 120 

4 0 AAGGTGGGTA AAACTCGGGA GTTGTAG 147 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico strain) 
15 406.3-2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
2 0 ACTTTTGATT ATCCGGGGCG GGCGCACACA TTTGATGACT TCTGCCCTGA ATGCCGCGCT 60 

TTAGGCCTCC AGGGTTGTGC TTTCCAGTCA ACTGTCGCTG AGCTCCAGCG CCTTAAAGTT 120 
AAGGTGGGTA AAACTCGGGA GTTGTAG 147 
(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma Strain) 
4 0 ORF-3 region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
ATGAATAACA TGTCTTTTGC TGCGCCCATG GGTTCGCGAC CATGCGCCCT CGGCCTATTT 
TGTTGCTGCT CCTCATGTTT TTGCCTATGC TGCCCGCGCC ACCGCCCGGT CAGCCGTCTG 
GCCGCCGTCG TGGGCGGCGC AGCGGCGGTT CCGGCGGTGG TTTCTGGGGT GACCGGGTTG 
ATTCTCAGCC CTTCGCAATC CCCTATATTC ATCCAACCAA CCCCTTCGCC CCCGATGTCA 
CCGCTGCGGC CGGGGCTGGA CCTCGTGTTC GCCAACCCGC CCGACCACTC GGCTCCGCTT 
GGCGTGACCA GGCCCAGCGC CCCGCCGTTG CCTCACGTCG TAGACCTACC ACAGCTGGGG 
CCGCGCCGCT AA 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico Strain) 
ORF-3 region 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ATGAATAACA TGTGGTTTGC TGCGCCCATG GGTTCGCCAC CATGCGCCCT AGGCCTCTTT 
TGCTGTTGTT CCTCTTGTTT CTGCCTATGT TGCCCGCGCC ACCGACCGGT CAGCCGTCTG 
GCCGCCGTCG TGGGCGGCGC AGCGGCGGTA CCGGCGGTGG TTTCTGGGGT GACCGGGTTG 
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ATTCTCAGCC CTTCGCAATC CCCTATATTC ATCCAACCAA CCCCTTTGCC CCAGACGTTG 240 

CCGCTGCGTC CGGGTCTGGA CCTCGCCTTC GCCAACCAGC CCGGCCACTT GGCTCCACTT 300 

GGCGAGATCA GGCCCAGCGC CCCTCCGCTG CCTCCCGTCG CCGACCTGCC ACAGCCGGGG 360 

CTGCGGCGCT GA 372 
(2) INFORMATION FOR SEQ ID NO: 11: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) 
406.4-2 region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCCAACCCGC CCGACCACTC GGCTCCGCTT GGCGTGACCA GGCCCAGCGC CCCGCCGTTG 60 
3 0 CCTCACGTCG TAGACCTACC ACAGCTGGGG CCGCGCCGC 99 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

3 5 (A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

4 0 (ii) MOLECULE TYPE: DNA (genomic) 



117 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico strain) 
5 406,4-2 region 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
10 GCCAACCAGC CCGGCCACTT GGCTCCACTT GGCGAGATCA GGCCCAGCGC CCCTCCGCTG 60 

CCTCCCGTCG CCGACCTGCC ACAGCCGGGG CTGCGGCGC 99 
(2) INFORMATION FOR SEQ ID NO: 13: 

15 

g (i) SEQUENCE CHARACTERISTICS: 

J (A) LENGTH: 660 amino acids 

'4 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

;~ 20 

: 

; fi (ii) MOLECULE TYPE: protein 

i (iii) HYPOTHETICAL: NO 

|* 2 5 (vi) ORIGINAL SOURCE: 

% (C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) 

: H ORF-2 



3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Arg Pro Arg Pro lie Leu Leu Leu Leu Leu Met Phe Leu Pro Met 
15 10 15 

3 5 Leu Pro Ala Pro Pro Pro Gly Gin Pro Ser Gly Arg Arg Arg Gly Arg 

20 25 30 

Arg Ser Gly Gly Ser Gly Gly Gly Phe Trp Gly Asp Arg Val Asp Ser 
35 40 45 



40 



Gin Pro Phe Ala lie Pro Tyr lie His Pro Thr Asn Pro Phe Ala Pro 
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50 55 60 

Asp Val Thr Ala Ala Ala Gly Ala Gly Pro Arg Val Arg Gin Pro Ala 
65 70 75 80 

5 

Arg Pro Leu Gly Ser Ala Trp Arg Asp Gin Ala Gin Arg Pro Ala Val 
85 90 95 

Ala Ser Arg jArg Arg Pro Thr Thr Ala Gly Ala Ala Pro Leu Thr Ala 
10 100 105 110 

Val Ala Pro Ala His Asp Thr Pro Pro Val Pro Asp Val Asp Ser Arg 
115 120 125 

15 Gly Ala lie Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu Thr 

130 135 140 

'Sj Ser Ser Val Ala Thr Gly Thr Asn Leu Val Leu Tyr Ala Ala Pro Leu 

j '[J 145 150 155 160 

|j 20 

i=S Ser Pro Leu Leu Pro Leu Gin Asp Gly Thr Asn Thr His lie Met Ala 

i;n 165 170 175 

M Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr lie 

I!! 5 ! 2 5 180 185 190 

!'3 i 

;;H Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala lie Ser 

U 195 200 205 

3 0 He Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp Met 

210 215 220 

Asn Ser He Thr Ser Thr Asp Val Arg He Leu Val Gin Pro Gly He 
225 230 235 240 



Ala Ser Glu Leu Val He Pro Ser Glu Arg Leu His Tyr Arg Asn Gin 
245 250 255 
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Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala Thr 
260 265 270 
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Ser Gly Leu Val Met Leu Cys lie His Gly Ser Leu Val Asn Ser Tyr 
275 280 285 

Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala Leu 
5 290 295 300 

Glu Leu Glu Phe Arg Asn Leu Thr Pro Gly Asn Thr Asn Thr Arg Val 
305 310 315 320 

10 Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala Asp 

325 330 335 

Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg ?he Met Lys Asp 
340 345 350 

15 

Leu Tyr Phe Thr Ser Thr Asn Gly Val Gly Glu lie Gly Arg Gly lie 

;3 355 360 365 

hQ 

^ Ala Leu Thr Leu Phe Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu Pro 

% 20 370 375 380 

!fl Thr Glu Leu lie Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg Pro 

10 385 390 395 400 

*2 25 Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser Val 

= n 405 410 415 

3 Glu Asn Ala Gin Gin Asp Lys Gly lie Ala lie Pro His Asp lie Asp 

^ 420 425 430 
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Leu Gly Glu Ser Arg Val Val lie Gin Asp Tyr Asp Asn Gin His Glu 
435 440 445 



Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser Val 

35 450 455 460 

Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Glu Tyr 

465 470 475 480 
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Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr Val Ser Asp 
485 490 495 



Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala Arg 
500 505 510 



Ser Leu Asp Trp Thr Lys Val Thr Leu Asp Gly Arg Pro Leu Ser Thr 
515 520 525 

lie Gin Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly Lys 
530 535 540 

Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr Asn 
545 550 555 560 

Tyr Asn Thr Thr Ala Ser Asp Gin Leu Leu Val Glu Asn Ala Ala Gly 
565 570 575 

His Arg Val Ala lie Ser Thr Tyr Thr Thr Ser Leu Gly Ala Gly Pro 
580 585 590 

Val Ser He Ser Ala Val Ala Val Leu Ala Pro His Ser Ala Leu Ala 
595 600 605 

Leu Leu Glu Asp Thr Leu Asp Tyr Pro Ala Arg Ala His Thr Phe Asp 
610 615 620 

Asp Phe Cys Pro Glu Cys Arg Pro Leu Gly Leu Gin Gly Cys Ala Phe 
625 630 635 640 

Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys Met Lys Val Gly Lys 
645 650 655 

Thr Arg Glu Leu 
660 

INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico Strain) 
ORF-2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Arg Pro Arg Pro Leu Leu Leu Leu Phe Leu Leu Phe Leu Pro Met 
15 10 15 

Leu Pro Ala Pro Pro Thr Gly Gin Pro Ser Gly Arg Arg Arg Gly Arg 
20 25 30 

Arg Ser Gly Gly Thr Gly Gly Gly Phe Trp Gly Asp Arg Val Asp Ser 
35 40 45 

Gin Pro Phe Ala He Pro Tyr lie His Pro Thr Asn Pro Phe Ala Pro 
50 55 60 

Asp Val Ala Ala Ala Ser Gly Ser Gly Pro Arg Leu Arg Gin Pro Ala 
65 70 75 80 

Arg Pro Leu Gly Ser Thr Trp Arg Asp Gin Ala Gin Arg Pro Ser Ala 
85 90 95 

Ala Ser Arg Arg Arg Pro Ala Thr Ala Gly Ala Ala Ala Leu Thr Ala 
100 105 110 

Val Ala Pro Ala His Asp Thr Ser Pro Val Pro Asp Val Asp Ser Arg 
115 120 125 

Gly Ala lie Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu Thr 
130 135 140 

Ser Ser Val Ala Ser Gly Thr Asn Leu Val Leu Tyr Ala Ala Pro Leu 
145 150 155 160 



Asn Pro Pro Leu Pro Leu Gin Asp Gly Thr Asn Thr His lie Met Ala 
165 170 175 



15 



30 
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Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr lie 
180 185 190 

Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala lie Ser 
5 195 200 205 

lie Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp Met 
210 215 220 

10 Asn Ser lie Thr Ser Thr Asp Val Arg lie Leu Val Gin Pro Gly lie 

225 230 235 240 

Ala Ser Glu Leu Val lie Pro Ser Glu Arg Leu His Tyr Arg Asn Gin 
245 250 255 



Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala Thr 
260 265 270 



Ser Gly Leu Val Met Leu Cys lie His Gly Ser Pro Val Asn Ser Tyr 

20 275 280 285 

Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala Leu 
290 295 300 

25 Glu Leu Glu Phe Arg Asn Leu Thr Thr Cys Asn Thr Asn Thr Arg Val 

305 310 315 320 

Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala Asp 
325 330 335 



Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys Asp 
340 345 350 



Leu His Phe Thr Gly Leu Asn Gly Val Gly Glu Val Gly Arg Gly lie 
35 355 360 365 

Ala Leu Thr Leu Leu Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu Pro 
370 375 380 
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Thr Glu Leu He Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg Pro 
385 390 395 400 
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Val Val Ser Ala Asn 
405 

Glu Asn Ala Gin Gin 
420 

Leu Gly Asp Ser Arg 
435 

Gin Asp Arg Pro Thr 



Leu Arg Ala Asn Asp 
465 

Asp Gin Ser Thr Tyr 
485 

Ser Val Thr Leu Val 
500 

Ser Leu Asp Trp Ser 
515 

Val Glu Gin Tyr Ser 
530 

Leu Ser Phe Trp Glu 
545 

Tyr Asn Thr Thr Ala 
565 

His Arg Val Ala lie 
580 

Val Ala lie Ser Ala 
595 

Leu Leu Glu Asp Thr 
610 



Gly Glu Pro Thr Val Lys 
410 

Asp Lys Gly Val Ala He 
425 

Val Val He Gin Asp Tyr 
440 

Pro Ser Pro Ala Pro Ser 

455 

Val Leu Trp Leu Ser Leu 
470 475 

Gly Ser Ser Thr Gly Pro 
490 

Asn Val Ala Thr Gly Ala 
505 

Lys Val Thr Leu Asp Gly 
520 

Lys Thr Phe Phe Val Leu 
535 

Ala Gly Thr Thr Lys Ala 
550 555 

Ser Asp Gin He Leu He 
570 

Ser Thr Tyr Thr Thr Arg 
585 

Ala Ala Val Leu Ala Pro 
600 

Phe Asp Tyr Pro Gly Arg 
615 



Leu Tyr Thr Ser Val 
415 

Pro His Asp He Asp 
430 

Asp Asn Gin His Glu 
445 

Arg Pro Phe Ser Val 
460 

Thr Ala Ala Glu Tyr 
480 

Val Tyr He Ser Asp 
495 

Gin Ala Val Ala Arg 
510 

Arg Pro Leu Pro Thr 
525 

Pro Leu Arg Gly Lys 
540 

Gly Tyr Pro Tyr Asn 
560 

Glu Asn Ala Ala Gly 
575 

Leu Gly Ala Gly Pro 
590 

Arg Ser Ala Leu Ala 
605 

Ala His Thr Phe Asp 
620 
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Asp Phe Cys Pro Glu Cys Arg Ala Leu Gly Leu Gin Gly Cys Ala Phe 
625 630 635 640 

Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys Val Lys Val Gly Lys 
5 645 650 655 

Thr Arg Glu Leu 
660 

10 (2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

, MKT. * ' 

! t 

y (ii) MOLECULE TYPE: protean 

m 

: : 3 20 (iii) HYPOTHETICAL: NO 

!;LI (vi) ORIGINAL SOURCE: 

I.; E 

(C) INDIVIDUAL ISOLATE: Hepatitis E virus (Burma strain) 
Q r62kDa, FIGURE 4 

25 

I 1 ! I 

"IT (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ala Val Ala Pro Ala His Asp Thr Pro Pro Val Pro Asp Val Asp Ser 
30 1 5 10 15 

Arg Gly Ala lie Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu 
20 25 30 

3 5 Thr Ser 'Ser Val Ala Thr Gly Thr Asn Leu Val Leu Tyr .Ala Ala Pro 

35 40 45 

Leu Ser Pro Leu Leu Pro Leu Gin Asp Gly Thr Asn Thr His lie Met 
50 55 60 
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Ala Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr 
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65 70 75 80 

lie Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala lie 
85 90 95 

5 

Ser lie Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp 
100 105 110 

Met Asn Ser lie Thr Ser Thr Asp Val Arg lie Leu Val Gin Pro Gly 
10 115 120 125 

lie Ala Ser Glu Leu Val lie Pro Ser Glu Arg Leu His Tyr Arg Asn 
130 135 140 

15 Gin Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala 

145 150 155 160 

lQ Thr Ser Gly Leu Val Met Leu Cys lie His Gly Ser Leu Val Asn Ser 

%j 165 170 175 

20 

^ Tyr Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala 

180 185 190 

is Leu Glu Leu Glu Phe Arg Asn Leu Thr Pro Gly Asn Thr Asn Thr Arg 

Q 25 195 200 205 

J Val Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala 

l*H 210 215 220 

3 0 Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys 

225 230 235 240 

Asp Leu Tyr Phe Thr Ser Thr Asn Gly Val Gly Glu He Gly Arg Gly 
245 250 255 



35 



He Ala Leu Thr Leu Phe Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu 
260 265 270 
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Pro Thr Glu Leu He Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg 
275 280 285 
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Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser 
290 295 300 

Val Glu Asn Ala Gin Gin Asp Lys Gly lie Ala lie Pro His Asp lie 

5 305 310 315 320 

i 

Asp Leu Gly Glu Ser Arg Val Val lie Gin Asp Tyr Asp Asn Gin His 
325 330 335 

10 Glu Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser 

340 345 350 

Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Giu 
355 360 365 

15 

Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr Val Ser 

i;3 370 375 380 

! 3 
0 
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Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala 
I 20 385 390 395 400 

i Arg Ser Leu Asp Trp Thr Lys Val Thr Leu Asp Gly Arg Pro Leu Ser 

l 405 410 415 

3 

25 Thr lie Gin Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly 
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420 425 430 

Lys Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr 
435 440 445 

Asn Tyr Asn Thr Thr Ala Ser Asp Gin Leu Leu Val Glu Asn Ala Ala 
450 455 460 

Gly His Arg Val Ala lie Ser Thr Tyr Thr Thr Ser Leu Gly Ala Gly 
35 465 470 475 480 

Pro Val Ser He Ser Ala Val Ala Val Leu Ala Pro His Ser Ala Leu 
485 490 495 
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Ala Leu Leu Glu Asp Thr Leu Asp Tyr Pro Ala Arg Ala His Thr Phe 
500 505 510 
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Asp Asp Phe Cys Pro Glu Cys Arg Pro Leu Gly Leu Gin Gly Cys Ala 
515 520 525 

Phe Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys Met Lys Val Gly 
530 535 540 

Lys Thr Arg Glu Leu 
545 

) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E virus (Mexico strain) 
r62kDa, FIGURE 4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Ala Val Ala Pro Ala His Asp Thr 
1 5 

Arg Gly Ala lie Leu Arg Arg Gin 
20 

Thr Ser Ser Val Ala Ser Gly Thr 

35 40 

Leu Asn Pro Pro Leu Pro Leu Gin 

50 55 

Ala Thr Glu Ala Ser Asn Tyr Ala 



Ser Pro Val Pro Asp Val Asp Ser 
10 15 

Tyr Asn Leu Ser Thr Ser Pro Leu 
25 30 

Asn Leu Val Leu Tyr Ala Ala Pro 
45 

Asp Gly Thr Asn Thr His lie Met 
60 

Gin Tyr Arg Val Ala Arg Ala Thr 



: I 5 
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65 70 75 80 

lie Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala lie 
85 90 95 

5 

Ser lie Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp 
100 105 110 

Met Asn Ser lie Thr Ser Thr Asp Val Arg lie Leu Val Gin Pro Gly 
10 115 120 125 

lie Ala Ser Glu Leu Val lie Pro Ser Glu Arg Leu His Tyr Arg Asn 
130 135 140 

15 Gin Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala 

145 150 155 160 



Thr Ser Gly Leu Val Met Leu Cys lie His Gly Ser Pro Val Asn Ser 
CP 165 170 175 



HO 2 0 
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Tyr Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala 
180 185 190 



p Leu Glu Leu Glu Phe Arg Asn Leu Thr Thr Cys Asn Thr Asn Thr Arg 

M 25 195 200 205 



Val Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala 
210 215 220 

3 0 Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys 

225 230 235 240 

Asp Leu His Phe Thr Gly Leu Asn Gly Val Gly Glu Val Gly Arg Gly 
245 250 255 



lie Ala Leu Thr Leu Leu Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu 
260 265 270 
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Pro Thr Glu Leu lie Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg 
275 280 285 



129 

Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser 
290 295 300 

Val Glu Asn Ala Gin Gin Asp Lys Gly Val Ala lie Pro His Asp lie 
5 305 310 315 320 

Asp Leu Gly Asp Ser Arg Val Val lie Gin Asp Tyr Asp Asn Gin His 
325 330 335 

10 Glu Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser 

340 345 350 

Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Glu 
355 360 365 

15 

Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr lie Ser 
370 375 380 

Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala 
20 385 390 395 400 



in 

=H Arg Ser Leu Asp Trp Ser Lys Val Thr Leu Asp Gly Arg Pro Leu Pro 

Jl 405 410 415 

s 

lA 25 Thr Val Glu Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly 

y 420 425 430 

sjwt 

-•4 Lys Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr 

435 440 445 

30 

Asn Tyr Asn Thr Thr Ala Ser Asp Gin lie Leu lie Glu Asn Ala Ala 
450 455 460 

Gly His Arg Val Ala lie Ser Thr Tyr Thr Thr Arg Leu Gly Ala Gly 
35 465 470 475 480 

Pro Val Ala He Ser Ala Ala Ala Val Leu Ala Pro Arg Ser Ala Leu 
485 490 495 
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Ala Leu Leu Glu Asp Thr Phe Asp Tyr Pro Gly Arg Ala His Thr Phe 
500 505 510 
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Asp Asp Phe Cys Pro Glu Cys Arg Ala Leu Gly Leu Gin Gly Cys Ala 
515 520 525 

Phe Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys Val Lys Val Gly 
530 535 540 

Lys Thr Arg Glu Leu 
545 

) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) SG3 

region 




<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gly Ala Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe 
15 10 15 

Met Lys Asp Leu Tyr Phe Thr Ser Thr Asn Gly Val Gly Glu lie Gly 
20 25 30 

Arg Gly lie Ala Leu Thr Leu Phe Asn Leu Ala Asp Thr Leu Leu Gly 
35 40 45 

Gly Leu Pro Thr Glu Leu lie Ser Ser Ala Gly Gly Gin Leu Phe Tyr 
50 55 60 



Ser Arg Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr 
65 70 75 80 
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Thr Ser Val Glu Asn 
85 

Asp lie Asp Leu Gly 
100 

Gin His Glu Gin Asp 
115 

Phe Ser Val Leu Arg 
130 

Ala Glu Tyr Asp Gin 
145 

Val Ser Asp Ser Val 
165 

Val Ala Arg Ser Leu 
180 

Leu Ser Thr lie Gin 
195 

Arg Gly Lys Leu Ser 
210, 

Pro Tyr Asn Tyr Asn 
225 

Ala Ala Gly His Arg 
245 

Ala Gly Pro Val Ser 
260 

Ala Leu Ala Leu Leu 
275 

Thr Phe Asp Asp Phe 
290 



Ala Gin Gin Asp Lys Gly 
90 

Glu Ser Arg Val Val He 
105 

Arg Pro Thr Pro Ser Pro 
120 

Ala Asn Asp Val Leu Trp 
135 

Ser Thr Tyr Gly Ser Ser 
150 155 

Thr Leu Val Asn Val Ala 
170 

Asp Trp Thr Lys Val Thr 
185 

Gin Tyr Ser Lys Thr Phe 
200 

Phe Trp Glu Ala Gly Thr 
215 

Thr Thr Ala Ser Asp Gin 
230 235 

Val Ala He Ser Thr Tyr 
250 

He Ser Ala Val Ala Val 
265 

Glu Asp Thr Leu Asp Tyr 
280 

Cys Pro Glu Cys Arg Pro 
295 



He Ala He Pro His 
95 

Gin Asp Tyr Asp Asn 
110 

Ala Pro Ser Arg Pro 
125 

Leu Ser Leu Thr Ala 
140 

Thr Gly Pro Val Tyr 
160 

Thr Gly Ala Gin Ala 
175 

Leu Asp Gly Arg Pro 
190 

Phe Val Leu Pro Leu 
205 

Thr Lys Ala Gly Tyr 
220 

Leu Leu Val Glu Asn 
240 

Thr Thr Ser Leu Gly 
255 

Leu Ala Pro His Ser 
270 

Pro Ala Arg Ala His 
285 

Leu Gly Leu Gin Gly 
300 



Cys Ala Phe Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys Met Lys 
305 310 315 320 



Val Gly Lys Thr Arg Glu Leu 
325 

) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 amino acids 
(3) TYPE: amine acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico strain) 
SG3 region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Gly Ala Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe 
15 10 15 

Met Lys Asp Leu His Phe Thr Gly Leu Asn Gly Val Gly Glu Val Gly 
20 25 30 

Arg Gly lie Ala Leu Thr Leu Leu Asn Leu Ala Asp Thr Leu Leu Gly 
35 40 45 

Gly Leu Pro Thr Glu Leu lie Ser Ser Ala Gly Gly Gin Leu Phe Tyr 
50 55 60 

Ser Arg Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr 
65 70 75 80 

Thr Ser Val Glu Asn Ala Gin Gin Asp Lys Gly Val Ala lie Pro His 
85 90 95 
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Asp lie Asp Leu Gly Asp Ser Arg Val Val He Gin Asp Tyr Asp Asn 
100 105 110 

Gin His Glu Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro 
115 120 125 

Phe Ser Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala 
130 135 140 

Ala Glu Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr 
145 150 155 160 

He Ser Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala 
165 170 175 

Val Ala Arg Ser Leu Asp Trp Ser Lys Val Thr Leu Asp Gly Arg Pro 
180 185 190 

Leu Pro Thr Val Glu Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu 
195 200 205 

Arg Gly Lys Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr 
210 215 220 

Pro Tyr Asn Tyr Asn Thr Thr Ala Ser Asp Gin He Leu He Glu Asn 
225 230 235 240 

Ala Ala Gly His Arg Val Ala He 'Ser Thr Tyr Thr Thr Arg Leu Gly 
245 250 255 

Ala Gly Pro Val Ala He Ser Ala Ala Ala Val Leu Ala Pro Arg Ser 
260 265 270 

Ala Leu Ala Leu Leu Glu Asp Thr Phe Asp Tyr Pro Gly Arg Ala His 
275 280 285 

Thr Phe Asp Asp Phe Cys Pro Glu Cys Arg Ala Leu Gly Leu Gin Gly 
290 295 300 



Cys Ala Phe Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys Val Lys 
305 310 315 320 
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Val Gly Lys Thr Arg Glu Leu 



325 



INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) 

406.4-2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Ala Asn Pro Pro Asp His Ser Ala Pro Leu Gly Val Thr Arg Pro Ser 

15 10 15 

Ala Pro Pro Leu Pro His Val Val Asp Leu Pro Gin Leu Gly Pro Arg 



INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



20 



25 



30 



Arg 



(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico strain) 
406.4-2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Ala Asn Gin Pro Gly His Leu Ala Pro Leu Gly Glu lie Arg Pro Ser 
15 10 15 

Ala Pro Pro Leu Pro Pro Val Ala Asp Leu Pro Gin Pro Gly Leu Arg 
20 25 30 

Arg 

) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma Strain) 

ORF-3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

Met Asn Asn Met Ser Phe Ala Ala Pro Met Gly Ser Arg Pro Cys Ala 
15 10 15 

Leu Gly Leu Phe Cys Cys Cys Ser Ser Cys Phe Cys Leu Cys Cys Pro 
20 25 30 



Arg His Arg Pro Val Ser Arg Leu Ala Ala Val Val Gly Gly Ala Ala 
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35 40 45 

Ala Val Pro Ala Val Val Ser Gly Val Thr Gly Leu lie Leu Ser Pro 
50 55 60 

5 

Ser Gin Ser Pro lie Phe lie Gin Pro Thr Pro Ser Pro Pro Met Ser 
65 70 75 80 

Pro Leu Arg Pro Gly Leu Asp Leu Val Phe Ala Asn Pro Pro Asp His 
10 85 90 95 

Ser Ala Pro Leu Gly Val Thr Arg Pro Ser Ala Pro Pro Leu Pro His 
100 105 110 

15 Val Val Asp Leu Pro Gin Leu Gly Pro Arg Arg Glx 

115 120 



M (2) INFORMATION FOR SEQ ID NO: 22: 

,SS3. 

!.s i 

2 0 (i) SEQUENCE CHARACTERISTICS: 

Q (A) LENGTH: 124 amino acids 

CP (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

f][ 25 (ii) MOLECULE TYPE: protein 

ru 

*P (iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

3 0 (C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico Strain) 

ORF-3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

35 

Met Asn Asn Met Trp Phe Ala Ala Pro Met Gly Ser Pro Pro Cys Ala 
15 10 15 
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Leu Gly Leu Phe Cys Cys Cys Ser Ser Cys Phe Cys Leu Cys Cys Pro 
20 25 30 



Arg His Arg Pro Val Ser Arg Leu Ala Ala Val Val Gly Gly Ala Ala 
35 40 45 



Ala Val Pro Ala Val Val Ser Gly Val Thr Gly Leu lie Leu Ser Pro 

50 55 60 

Ser Gin Ser Pro lie Phe lie Gin Pro Thr Pro Leu Pro Gin Thr Leu 
65 70 75 80 



Pro Leu Arg Pro Gly Leu Asp Leu Ala Phe Ala Asn Gin Pro Gly His 
85 90 95 

Leu Ala Pro Leu Gly Glu lie Arg Pro Ser Ala Pro Pro Leu Pro Pro 
100 105 110 

Val Ala Asp Leu Pro Gin Pro Gly Leu Arg Arg Glx 
115 120 

) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) 

406.3-2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Thr Leu Asp Tyr Pro Ala Arg Ala His Thr Phe Asp Asp Phe Cys Pro 
15 10 15 

Glu Cys Arg Pro Leu Gly Leu Gin Gly Cys Ala Phe Gin Ser Thr Val 
20 25 30 
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Ala Glu Leu Gin Arg Leu Lys Met Lys Val Gly Lys Thr Arg Glu Leu 
35 40 45 



a 



5 (2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 



15 



! 1' 20 



25 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico strain) 
406.3-2 region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Thr Phe Asp Tyr Pro Gly Arg Ala His Thr Phe Asp Asp Phe Cys Pro 
15 10 15 

Glu Cys Arg Ala Leu Gly Leu Gin Gly Cys Ala Phe Gin Ser Thr Val 
20 25 30 

Ala Glu Leu Gin Arg Leu Lys Val Lys Val Gly Lys Thr Arg Glu Leu 
30 35 40 45 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 540 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



40 



(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

. (C) INDIVIDUAL ISOLATE: Hepatitis E virus (Burma strain) 
r62kDa, 58.1 kDa 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Ala Val Ala Pro Ala His Asp Thr Pro Pro Val Pro Asp Val Asp Ser 
15 10 15 

Arg Gly Ala lie Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu 
20 25 30 

Thr Ser Ser Val Ala Thr Gly Thr Asn Leu Val Leu Tyr Ala Ala Pro 
35 40 45 

Leu Ser Pro Leu Leu Pro Leu Gin Asp Gly Thr Asn Thr His lie Met 
50 55 60 

Ala Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr 
65 70 75 80 

lie Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala lie 
85 90 95 

Ser lie Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp 
100 105 110 

Met Asn Ser lie Thr Ser Thr Asp Val Arg lie Leu Val Gin Pro Gly 
115 120 125 

lie Ala Ser Glu Leu Val lie Pro Ser Glu Arg Leu His Tyr Arg Asn 
130 135 140 

Gin Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala 
145 150 155 160 



Thr Ser Gly Leu Val Met Leu Cys lie His Gly Ser Leu Val Asn Ser 
165 170 175 
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Tyr Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala 
180 185 190 

Leu Glu Leu Glu Phe Arg Asn Leu Thr Pro Gly Asn Thr Asn Thr Arg 
195 200 205 

Val Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala 
210 215 220 

Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys 

225 230 235 240 

Asp Leu Tyr Phe Thr Ser Thr Asn Gly Val Gly Glu lie Gly Arg Gly 
245 250 255 

lie Ala Leu Thr Leu Phe Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu 
260 265 270 

Pro Thr Glu Leu lie Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg 
275 280 285 

Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser 
290 295 300 

Val Glu Asn Ala Gin Gin Asp Lys Gly lie Ala lie Pro His Asp lie 
305 310 315 320 

Asp Leu Gly Glu Ser Arg Val Val lie Gin Asp Tyr Asp Asn Gin His 
325 330 335 

Glu Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser 
340 345 350 

Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Glu 
355 360 365 

Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr Val Ser 
370 375 380 



Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala 
385 390 395 400 
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Arg Ser Leu Asp Trp Thr Lys Val Thr Leu Asp Gly Arg Pro Leu Ser 
405 410 415 

Thr lie Gin Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly 
5 420 425 430 

Lys Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr 
435 440 445 

10 Asn Tyr Asn Thr Thr Ala Ser Asp Gin Leu Leu Val Glu Asn Ala Ala 

450 455 460 



Gly His Arg Val Ala lie Ser Thr Tyr Thr Thr Ser Leu Gly Ala Gly 
465 470 475 480 

15 

Pro Val Ser lie Ser Ala Val Ala Val Leu Ala Pro His Ser Ala Leu 
l 'i 485 490 495 

M 

ffi Ala Leu Leu Glu Asp Thr Leu Asp Tyr Pro Ala Arg Ala His Thr Phe 

; J3 20 500 505 510 

i'jpi 

Asp Asp Phe Cys Pro Glu Cys Arg Pro Leu Gly Leu Gin Gly Cys Ala 
^ 515 520 525 

\^ 25 Phe Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys 

TU 530 1 535 540 



(2) INFORMATION FOR SEQ ID NO: 26: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: NO 



40 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E virus (Mexico strain) 
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r62kDa, 58.1 kDa 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Ala Val Ala Pro Ala His Asp Thr Ser Pro Val Pro Asp Val Asp Ser 
15 10 15 

Arg Gly Ala He Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu 
20 25 30 

Thr Ser Ser Val Ala Ser Gly Thr Asn Leu Val Leu Tyr Ala Ala Pro 
35 40 45 

Leu Asn Pro Pro Leu Pro Leu Gin Asp Gly Thr Asn Thr His He Met 
50 55 60 

Ala Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr 
65 70 75 80 

He Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala He 
85 90 95 

Ser He Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp 
100 105 110 

Met Asn Ser He Thr Ser Thr Asp Val Arg He Leu Val Gin Pro Gly 
115 120 125 

He Ala Ser Glu Leu Val He Pro Ser Glu Arg Leu His Tyr Arg Asn 
130 135 140 

Gin Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala 
145 150 155 160 

Thr Ser Gly Leu Val Met Leu Cys He His Gly Ser Pro Val Asn Ser 
165 170 175 

Tyr Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala 
180 185 190 

Leu Glu Leu Glu Phe Arg Asn Leu Thr Thr Cys Asn Thr Asn Thr Arg 
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195 200 205 

Val Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala 
210 215 220 

5 

Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys 
225 230 235 240 

Asp Leu His Phe Thr Gly Leu Asn Gly Val Gly Glu Val Gly Arg Gly 
10 245 250 255 

lie Ala Leu Thr Leu Leu Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu 
260 265 270 

15 Pro Thr Glu Leu lie Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg 

275 280 285 

hJ 

Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser 
290 295 300 

3 20 

LJ Val Glu Asn Ala Gin Gin Asp Lys Gly Val Ala lie Pro His Asp lie 

CP 305 310 315 320 

m 

!L Asp Leu Gly Asp Ser Arg Val Val lie Gin Asp Tyr Asp Asn Gin His 

\2 25 325 330 335 

: =p Glu Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser 

□ 340 345 350 

3 0 Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Glu 

355 360 365 

Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr He Ser 
370 375 380 



35 



Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala 
385 390 395 400 
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Arg Ser Leu Asp Trp Ser Lys Val Thr Leu Asp Gly Arg Pro Leu Pro 
405 410 415 
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Thr Val Glu Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly 
420 425 430 

Lys Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr 
435 440 445 

Asn Tyr Asn Thr Thr Ala Ser Asp Gin lie Leu lie Glu Asn Ala Ala 
450 455 460 

Gly His Arg Val Ala lie Ser Thr Tyr Thr Thr Arg Leu Gly Ala Gly 
465 470 475 480 

Pro Val Ala lie Ser Ala Ala Ala Val Leu Ala Pro Arg Ser Ala Leu 
485 490 495 

Ala Leu Leu Glu Asp Thr Phe Asp Tyr Pro Gly Arg Ala His Thr Phe 
500 505 510 

Asp Asp Phe Cys Pro Glu Cys Arg Ala Leu Gly Leu Gin Gly Cys Ala 
515 520 525 

Phe Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys 
530 535 540 

INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(vi) 



ORIGINAL SOURCE: 
(C) INDIVIDUAL ISOLATE: Hepatitis E virus (Burma strain) 
r62kDa, 5.6,5 kDa 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Ala Val Ala Pro Ala His Asp Thr Pro Pro Val Pro Asp Val Asp Ser 
15 10 15 

Arg Gly Ala He Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu 
20 25 30 

Thr Ser Ser Val Ala Thr Gly Thr Asn Leu Val Leu Tyr Ala Ala Pro 
35 40 45 

Leu Ser Pro Leu Leu Pro Leu Gin Asp Gly Thr Asn Thr His He Met 
50 55 60 

Ala Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr 
65 70 75 80 

He Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala He 
85 90 95 

Ser He Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp 
100 105 HO 

Met Asn Ser He Thr Ser Thr Asp Val Arg He Leu Val Gin Pro Gly 
115 120 125 

He Ala Ser Glu Leu Val He Pro Ser Glu Arg Leu His Tyr Arg Asn 
130 135 140 

Gin Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala 
145 150 155 160 

Thr Ser Gly Leu Val Met Leu Cys He His Gly Ser Leu Val Asn Ser 
165 170 175 

Tyr Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala 
180 185 190 



Leu Glu Leu Glu Phe Arg Asn Leu Thr Pro Gly Asn Thr Asn Thr Arg 
195 200 205 
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Val Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala 
210 215 220 

Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys 
225 230 235 240 

Asp Leu Tyr Phe Thr Ser Thr Asn Gly Val Gly Glu lie Gly Arg Gly 
245 250 255 

lie Ala Leu Thr Leu Phe Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu 
260 265 270 

Pro Thr Glu Leu lie Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg 
275 280 285 

Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser 
290 295 300 

Val Glu Asn Ala Gin Gin Asp Lys Gly He Ala He Pro His Asp He 
305 310 315 320 

Asp Leu Gly Glu Ser Arg Val Val He Gin Asp Tyr Asp Asn Gin His 
325 330 335 

Glu Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser 
340 345 350 

Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Glu 
355 360 365 

Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr Val Ser 
370 375 380 

Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala 
385 390 395 400 

Arg Ser Leu Asp Trp Thr Lys Val Thr Leu Asp Gly Arg Pro Leu Ser 
405 410 415 



Thr He Gin Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly 
420 425 430 



• ... • 
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Lya Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr 
435 440 445 

Asn Tyr Asn Thr Thr Ala Ser Asp Gin Leu Leu Val Glu Asn Ala Ala 
450 455 460 

Gly His Arg Val Ala lie Ser Thr Tyr Thr Thr Ser Leu Gly Ala Gly 
465 470 475 480 

Pro Val Ser lie Ser Ala Val Ala Val Leu Ala Pro His Ser Ala Leu 
485 490 495 

Ala Leu Leu Glu Asp Thr Leu Asp Tyr Pro Ala Arg Ala His Thr Phe 
500 505 510 

Asp Asp Phe Cys Pro Glu Cys Arg Pro Leu Gly Leu Gin 
515 520 525 

INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E virus (Mexico strain) 
r62kDa, 56.5 kDa 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Ala Val Ala Pro Ala His Asp Thr Ser Pro Val Pro Asp Val Asp Ser 
1.5 10 15 



Arg Gly Ala lie Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu 
20 25 30 
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Thr Ser Ser Val Ala Ser Gly Thr Asn Leu Val Leu Tyr Ala Ala Pro 
35 40 45 

Leu Asn Pro Pro Leu Pro Leu Gin Asp Gly Thr Asn Thr His lie Met 
50 55 60 

Ala Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr 
65 70 75 80 

lie Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala He 
85 90 95 

Ser He Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp 
100 105 110 

Met Asn Ser He Thr Ser Thr Asp Val Arg He Leu Val Gin Pro Gly 
115 120 125 

He Ala Ser Glu Leu Val He Pro Ser Glu Arg Leu His Tyr Arg Asn 
130 135 140 

Gin Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala 
145 150 155 160 

Thr Ser Gly Leu Val Met Leu Cys He His Gly Ser Pro Val Asn Ser 
165 170 175 

Tyr Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala 
180 185 190 

Leu Glu Leu Glu Phe Arg Asn Leu Thr Thr Cys Asn Thr Asn Thr Arg 
195 200 205 

Val Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala 
210 215 220 

Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys 
225 230 235 240 



Asp Leu His Phe Thr Gly Leu Asn Gly Val Gly Glu Val Gly Arg Gly 
245 250 255 
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lie Ala Leu Thr Leu Leu Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu 
260 265 270 

Pro Thr Glu Leu lie Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg 
275 280 285 

Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser 
290 295 300 

Val Glu Asn Ala Gin Gin Asp Lys Gly Val Ala lie Pro His Asp lie 
305 310 315 320 

Asp Leu Gly Asp Ser Arg Val Val He Gin Asp Tyr Asp Asn Gin His 
325 330 335 

Glu Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser 
340 345 350 

Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Glu 
355 360 365 

Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr He Ser 
370 375 380 

Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala 
385 390 395 400 

Arg Ser Leu Asp Trp Ser Lys Val Thr Leu Asp Gly Arg Pro Leu Pro 
405 410 415 

Thr Val Glu Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly 
420 425 430 

Lys Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr 
435 440 445 

Asn Tyr Asn Thr Thr Ala Ser Asp Gin He Leu He Glu Asn Ala Ala 
450 455 460 

Gly His Arg Val Ala He Ser Thr Tyr Thr Thr Arg Leu Gly Ala Gly 
465 470 475 480 



i:n 



25 



30 



40 
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Pro Val Ala He Ser Ala Ala Ala Val Leu Ala Pro Arg Ser Ala Leu 
485 490 495 

Ala Leu Leu Glu Asp Thr Phe Asp Tyr Pro Gly Arg Ala His Thr Phe 
5 500 505 510 

Asp Asp Phe Cys Pro Glu Cys Arg Ala Leu Gly Leu Gin 
515 520 525 

10 (2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 
20 (iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HEV 5' primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GGGGGGGATC CATATGGCGG TCGCTCCGGC CCATGACACC CCG 43 

(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
3 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HEV 3' primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
ATTAGAAGCT TCCGTGGCCA TTATATG 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: internally consistent sequence 

two HEV 406.4-2 antigens 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "where Xaa is Q or P" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "where Xaa is G or D" 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "where Xaa is L or S" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /note= "where Xaa is E or V" 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /note= "where Xaa is I or T" 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 22 

(D) OTHER INFORMATION: /note= "where Xaa is P or H" 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 24 

(D) OTHER INFORMATION: /note= "where Xaa is A or V" 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 29 

(D) OTHER INFORMATION: /note= "where Xaa is P or L" 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 31 

(D) OTHER INFORMATION: /note= "where Xaa is L or P" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Ala Asn Xaa Pro Xaa His Xaa Ala Pro Leu Gly Xaa Xaa Arg Pro Ser 
15 10 15 

Ala Pro Pro Leu Pro Xaa Val Xaa Asp Leu Pro Gin Xaa Gly Xaa Arg 
20 25 30 



Arg 



